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D script! n 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention: 

The present invention relates to a novel protease which specifically cleaves the peptide bond at the 
carboxyl termini of glutamic acid residues in the amino acid sequences of polypeptides, a method for 
producing the protease from bacteria of the genus Bacillus , a DNA sequence encoding the protease, an 
10 expression vector containing the DNA sequence, a transformant obtained by introducing this expression 
vector into a host, and a method for producing the protease using the transformant. 

2. Description of the Prior Art: 

75 The V8 protease derived from the V8 strain of Staphylococcus aureus is already known as an enzyme 
which acts upon proteins (i.e., polypeptides) and specifically cleaves the peptide bond at the carboxyl 
terminal of glutamic acid (Glu) residues (Gabriel R. Drapeau et al., J. Biol. Chem. 247 , 20, 6720-6726, 
1972). This enzyme is classified as a serine protease. C. Carmona et al. have cloned the DNA sequence 
encoding this enzyme (Nucl. Acids Res., 15, 6757, 1987). 

20 A similar enzyme, an endopetidase which is specific for acidic amino acid residue and is derived from 
an actinomycete bacterium Streptomyces griseus , is also known (Norio Yoshida et al., J. Biochem. 104 , 3, 
451-456, 1988). Furthermore, an endoprotease which is specific for glutamic acid residue derived from 
Bacillus subtilis is also known (Takuro Niidome, Norio Yoshida, Fusahiro Ogata, Akio Ito, and Kosaku Noda, 
J. Biochem. 108 , 965-970, 1990); Abstracts of 62nd General Conference of the Japan Biochemical Society). 

25 The aforesaid enzymes are useful when specific cleavage of proteins at the aforesaid sites is desired 
for the purposes of protein structural analysis, etc., or when recombinant DNA techniques have been 
employed to produce a desired protein in the form of a certain fusion protein, from which the desired 
protein is to be obtained by cleavage. In the latter case, for example, when the desired protein has been 
produced in the form of a fusion protein in which the desired protein is linked with another protein via a 

30 glutamic acid residue, the desired protein can be separated by cleavage with such an enzyme. For these 
reasons, the availability of other proteases possessing this type of enzymatic activity, in addition to those 
mentioned above, would be highly desirable. 

SUMMARY OF THE INVENTION 

35 

The novel protease of this invention, which overcomes the above-discussed and numerous other 
disadvantages and deficiencies of the prior art, is derived from Bacillus licheniformis , The protease cleaves 
the peptide bonds at the carboxyl termini of glutamic acid residues in polypeptides, and contains an amino 
acid sequence from serine in the +1 position to glutamine in the + 222 position of SEQ ID NO: 1 . 
40 In a preferred embodiment, the protease is derived from Bacillus licheniformis ATCC No. 14580. 

In a preferred embodiment, the protease has the following properties: 

(1) Optimal pH: approximately 8.0, and 

(2) Stable pH range: pH 6.5-8.5 at 25 • C. 

In a preferred embodiment, the protease contains an amino acid sequence from serine in the +1 
45 position to glutamine in the + 222 position of SEQ ID NO: 1 , and cleaves the peptide bonds at the carboxyl 
termini of glutamic acid residues in polypeptides. 

The DNA sequence of this invention encodes the above-mentioned protease. 

In a preferred embodiment, the DNA sequence contains a base sequence from the thymine residue in 
the 605 position to the adenosine residue in the 1270 position of SEQ ID NO: 1. 
so In a preferred embodiment, the DNA sequence encodes a protease containing an amino acid sequence 
from N-formylmethionine at the -94 position to the glutamine at the + 222 position of SEQ ID NO: 1 . 

In a preferred embodiment, the DNA sequence contains a base sequence, from the thymine residue in 
the 323 position to the adenosine residue in the 1270 position, of SEQ ID NO: 1. 

The expression vector of this invention contains the above-mentioned DNA sequence. 
55 In a preferred embodiment, the expression vector is expressible in bacteria of the genus Bacillus . 

The transformant of this invention is obtainable by introducing the above-mentioned expression vector 
into a host. 

In a preferred embodiment, the host is a strain belonging to the genus Bacillus . 



3 



EP 0 482 879 B1 



The method for producing a protease of this invention comprises the steps of cultivating a strain of 
Bacillus licheniformis capable of producing the above-mentioned protease in a culture medium and 
recovering the produced protease from the culture medium. 

The method for producing a protease of this invention comprises the steps of cultivating the above- 
5 mentioned transformant in a culture medium and recovering the produced protease from the culture 
medium. 

It is understood that various other modifications will be apparent to and can be readily made by those 
skilled in the art without departing from the scope and spirit of this invention. Accordingly, it is not intended 
that the scope of the claims appended hereto be limited to the description as set forth herein, but rather 
70 that the claims be construed as encompassing all the features of patentable novelty that reside in the 
present invention, including ail features that would be treated as equivalents thereof by those skilled in the 
art to which this invention pertains. 

Thus, the invention described herein makes possible the objectives of: 

(1) providing a novel protease with an enzymatic activity of specifically cleaving polypeptides at the 
75 carboxyl termini of glutamic acid residues; and 

(2) providing a DNA sequence encoding the protease, an expression vector containing the DNA 
sequence, a transformant obtained by introduction of the expression vector into a host, and a method for 
the production of the protease using the transformant. 

20 BRIEF DESCRIPTION OF THE DRAWINGS 

This invention may be better understood and its numerous objects and advantages will become 
apparent to those skilled in the art by reference to the accompanying drawings as follows: 

Figures 1-1 to 1-3 show the DNA sequence of the protease of the present invention and the amino acid 
25 sequence deduced from the DNA sequence. 

Figure 2 is a schematic diagram illustrating the construction of the expression vector pHY300BLtt of the 
present invention. 

Figure 3 is a schematic diagram illustrating the construction of the shuttle vector pHY300PLKtt used in 
the construction of the expression vector pHY300BLtt of the present invention. 
30 Figure 4 shows graphs indicating the elution of the enzyme of the present invention from an affinity 
column in the process of extraction and purification of the enzyme from the medium in which Bacillus 
licheniformis ATCC No. 14580 was cultivated. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

35 

The present inventors have conducted various studies with a view to obtaining proteases, possessing 
the enzymatic action of cleaving peptides at the carboxyl termini of glutamic acid residues, from micro- 
organisms other than the aforesaid Staphylococcus aureus , etc. As a result of these researches, the present 
inventors have discovered a novel protease possessing the aforesaid property, derived from Bacillus 

40 licheniformis ATCC No. 14580. Furthermore, the present inventors have also found a DNA sequence 
encoding this protease and created an expression vector containing the DNA sequence as well as 
transformant obtained by introduction of the expression vector into a host, and discovered a method for the 
production of this protease using the transformant, thereby completing the present invention. 

The protease of the present invention (hereinafter referred to as BLase) is produced by bacteria of the 

45 genus Bacillus , in particular, by Bacillus licheniformis ATCC No. 14580. This strain is available from the 
American Type Culture Collection (ATCC). 

I. Culture conditions 

so No special medium is required for the cultivation of the aforesaid bacterial strain, and any of the various 
conventional types of culture medium are suitable for this purpose. For example, a medium containing 
glucose, soybean powder, meat extract, corn steep liquor, and the various inorganic salts, etc., can be used. 
The appropriate medium pH is 5-9, preferably approximately 7.0, the appropriate medium temperature is 
15-50'C, preferably approximately 28 *C, and the bacteria are cultured, for example, aerobically with 

55 stirring or shaking for approximately 36 hours. The enzyme BLase of the present invention was principally 
secreted extracellularly. 
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II. Collection of enzyme 

Known processes for the collection and purification of enzymes can be used, either singly or in 
combination, for the collection and purification of the present enzyme from the aforesaid culture broth. For 
example, the culture broth can be subjected to filter pressing, ultrafiltration, and centrifugal separation, 
thereby obtaining a cell-free liquid concentrate. The enzyme of the present invention can then be obtained 
from this concentrate by an appropriate method of purification. For example, the aforesaid concentrate can 
be subjected first to preliminary purification by ion exchange chromatography, and then to chromatography 
with S-Sepharose, and finally to affinity chromatography, thereby obtaining the present enzyme. In Example 
1 shown below, enzyme specimen with activity 1.9 x 10 3 to 2.4 x 10 3 U/mg (assayed by the method for the 
measurement of enzymatic activity described below) was obtained by this type of procedure. This enzyme 
specimen was used for the determination of enzyme properties described below. 

III. Method for the measurement of enzymatic activity 

Z-Phe Leu Glu-pNA (wherein Z is a carbobenzoxy group and pNA is a p-nitroaniline group), used as a 
substrate, is dissolved in 50 mM Tris-HCl (pH 7.5, containing 2 mM calcium chloride and 2% DMF) so as to 
achieve a final substrate concentration of 0.2 mM. An enzyme solution is added to this mixture, and a 
reaction is conducted at 37 *C for 10 minutes, then the 410 nm absorbance of the p-nitroaniline released 
into the liquid by the enzymatic reaction is measured. The enzymatic activity present when this absorbance 
is 1 .0 is defined as 1 unit (U). 

IV. Enzyme properties 

The enzymatic properties and protein chemical properties of BLase of the present invention are as 
follows. 

(1 ) Enzymatic action and substrate specificity 

(i) The synthetic substrates shows in Table 1 were prepared, and each of them was dissolved in 50 ml 
Tris-HCl (pH 7.5, containing 2 mM calcium chloride and dimethylformamide (DMF) or dimethyl sulfoxide 
(DMSO) in the proportions indicated by Table 1) so as to achieve the concentration shown in Table 1. 
Then, the present enzyme was added to this solution and a reaction was conducted at 25 *C. The 410 
nm absorbance of the p-nitroaniline released into the liquid by the enzymatic reaction was measured, 
and the quantity (nmol) of p-nitroaniline released from 1 mg of the substrate per minute was calculated; 
the results so obtained are shown in Table 1. 
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(ii) Oxidized insulin B chain was selected as a protein substrate, and the actions of the present enzyme 
and V8 protease derived from Staphylococcus aureus upon this substrate were compared by the 
following procedure. First, oxidized insulin B chain was dissolved in 50 mM ammonium bicarbonate (pH 
7.8), the present enzyme or the aforesaid V8 protease was added so as to achieve an enzyme/substrate 
ratio of 1/100 (W/W), and a reaction was conducted over a prescribed period of time. The reaction 



EP0 482 879 B1 



mixture was then subjected to HPLC using a 4.6 x 250 mm column packed with Vydac Protein C* (300 
angstroms), which was eluted under a 0-50% acetonitrile linear gradient in 0.1% TFA, raising the 
acetonitrile concentration by 1.67%/min. Peptide mapping revealed that, when either of the enzymes was 
used, the peptide bonds at the carboxyl termini of the glutamic acid residues were cleaved, and the 
s products of enzymatic hydrolysis induced by the two enzymes were identical with each other. 

Thus, the results of the aforesaid analyses (i) and (ii) demonstrated that the present enzyme cleaves 
peptide bonds at the carboxyl termini of glutamic acid residues, and is indeed a glutamic acid specific 
endopeptidase. 

io (2) Optimal pH and stable pH range 

Z-Phe Leu Glu-pNA as a substrate was dissolved in 50 mM Tris-HCI containing 10% DMF and 2 mM 
calcium chloride. Then, the present enzyme was added to this mixture, a reaction was conducted for 15 
minutes at 37 *C, and the 410 nm absorbance of the p-nitroaniline released into the liquid by the enzymatic 
75 reaction was measured. The aforesaid reaction was conducted at various pH values, and the results 
revealed that the optimal pH for enzymatic activity is 8.0. 

Next, the present enzyme was maintained at 25 • C for 24 hours at various pH values, and in each case 
the enzyme after this treatment was allowed to react with a substrate in accordance with the procedures 
described in the method for the measurement of enzymatic activity mentioned above. The results indicate 
20 that the stable pH range of the present enzyme is about 4.0-10.0. In a pH range of 6.5-8.5, the enzymatic 
activity is maintained at 100%, and in pH ranges exceeding 4.0 up to less than 6.5, and exceeding 8.5 up to 
10.0, the enzymatic activity is maintained at 80-100%. 

(3) Thermal stability 

25 

The present enzyme was maintained for 15 minutes at various temperatures in a buffer solution 
containing 2 mM calcium chloride at pH 7.8. In each case, the enzyme after this treatment was allowed to 
react with a substrate in accordance with the procedures described in the method for the measurement of 
enzymatic activity mentioned above. The results indicated that under the aforesaid conditions the present 
30 enzyme is stable at temperatures up to 60 *C. When the present enzyme was similarly kept in solutions not 
containing calcium chloride, it was stable at temperatures up to 50 * C. 

(4) Effect of inhibitors 

as The present enzyme is completely inhibited by diisopropyl fluorophosphate (DFP). This fact indicates 
that the present enzyme is classified as a serine protease. 

The present enzyme is also completely inhibited by Z-Phe Leu Glu CH2CI. This fact indicates that the 
present enzyme is a glutamic acid specific endopeptidase. 

The present enzyme is partially inhibited by EDTA, with a maximum inhibition ratio of approximately 
40 72%. This inhibitory effect of EDTA is completely nullified by the addition of metal ions at low concentra- 
tions (10~* to 10~ 3 M of calcium or magnesium ions, etc.). 

The aforesaid facts indicate that the present enzyme is a typical serine protease, the stability of which 
is related to the presence of metal ions. 

45 (5) Molecular weight 

The molecular weight of the present enzyme was determined by SDS-PAGE using 15% gel (1.0 mm) 
and Rainbow™ Protein Molecular Weight Marker (Amersham), and was calculated to be 26,000. The 
molecular weight was also calculated from the amino acid sequence determined on the basis of the gene 

50 sequencing analysis to be described below, and the value so obtained was 23,567 which is somewhat 
different from the aforesaid value obtained by SDS-PAGE. Nevertheless, the results of the various protein 
chemical analyses to be described below (amino acid composition, amino terminal sequences, amino acid 
composition in the vicinity of the carboxyl terminus) agreed well with the structure deduced from the DNA 
sequence. This indicates that the molecular weight obtained by SDS-PAGE was, in fact, slightly in excess of 

55 the true value. 
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(6) Isoelectric point 

Investigation of the isoelectric point of the present enzyme using the Pharmacia FAST System 
(Pharmalite, pH 3.0-10.0) yielded values above pH 9.0, and a normal value could not be obtained. 

(7) Amino acid composition 

Using 4 M methanesulfonic acid (containing 0.2% of 3-(2-aminoethyl)indole), the present enzyme was 
hydrolyzed at 1 1 0 • C for prescribed time intervals (24, 48, or 72 hours). The respective hydrolysates were 
then subjected to amino acid analysis using a Hitachi Model 835 amino acid analyzer. The results of this 
analysis, corrected for the decomposition of amino acids in the process of hydrolysis, are shown in Table 2. 
The amino acid composition calculated from the DNA sequence of the present enzyme (described below) 
are also shown for comparison in Table 2. The two sets of results clearly display good agreement. 
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(8) Partial amino acid sequences 

(i) Amino acid sequence near N terminus 

A Model 477A Protein Sequencer (Applied Biosystems) was used to analyze the amino acid 
s sequence of the present enzyme in the vicinity of the amino terminus. The enzyme samples used were 
inhibited beforehand with DFP. The amino acid sequence from the amino terminus to the 23rd residue is 
shown in Table 3. 
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<ii) Amino acid sequence near C terminus 
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Carboxypeptidase A (CPase A) or carboxypeptidase Y (CPase Y) was allowed to act upon samples 
of the present enzyme inhibited beforehand with DFP, and the quantities of amino acids released by 
these reactions were measured with an amino acid analyzer of Hitachi Model 835. However, the amino 
acid sequence in the vicinity of the carboxyl terminus of the present enzyme could not be accurately 
5 determined using either of the aforesaid carboxypeptidases. Nevertheless, the presence of glutamine, 
serine, alanine and asparagine in the vicinity of the carboxyt terminus was verified. 

V. Determination of DNA sequence encoding BLase 

70 Certain terminology employed in the specification of the present invention is defined as follows. 

"Oligonucleotide" refers to a short single-strand DNA molecule. Oligonucleotides can be chemically 
synthesized in accordance with known methods. Unless otherwise stated, the oligonucleotides used in the 
processes of the present invention are chemically synthesized, and are purified by gel chromatography 
using Sephadex G50 and high-performance liquid chromatography (HPLC) with a reverse phase silica gel 

75 column. 

"PCR" is an acronym of "polymerase chain reaction", and refers to a method for enzymatic amplifica- 
tion of a definite DNA region (Saiki et al., Science, 239, 487-497, 1988). First, the DNA to be amplified is 
converted to single-strand form by thermal denaturation, and oligonucleotide primers (two types, i.e., sense 
and antisense strands, each having a complemental sequence to the 3'-terminal region of the said single- 

20 stranded DNA) are annealed to the regions at the respective 3'-termini of the single-stranded DNA (i.e., the 
template DNA). Next, the extension of the DNA strands from the respective primers is accomplished by a 
reaction using DNA polymerase. By repeating this sequence of reactions, the target DNA can be amplified 
by a factor of 100,000 to 1 ,000,000. 

"Southern blotting" is a method for determining whether or not a specified gene is contained in a DNA 

25 fragment obtained by cleavage with a certain restriction enzyme. Southern blotting is performed by first 
digesting the DNA sample under investigation with a restriction enzyme which specifically recognizes a 
certain base sequence in duplex DNA and cleaves this DNA at specific sites. The digest so obtained is 
subjected to 1% agarose gel electrophoresis, then denatured into single-stranded DNA by alkali treatment, 
and transferred to a nylon filter. Separately, an oligonucleotide or DNA fragment constituting a portion of the 

30 gene in question is prepared and labelled to obtain a probe. Hybridization of the single-stranded DNA on 
the nylon filter with this probe is then used to detect the presence of the gene in question. 

"Ligation" refers to the creation of phosphodiester bond between two duplex DNA fragments. In this 
technique, in order to prevent the self-ligation of the duplex DNA fragments, one of the fragments is 
subjected to prior dephosphorylation treatment by the conventional method (T. Maniatis et al., "Molecular 

35 Cloning", 133-134, 1982). Ligation can be accomplished with T4 DNA ligase, using a well known type of 
buffer solution and reaction conditions. 

"Transformation" refers to the phenomenon wherein the genotype of a cell (i.e., a host cell) is 
transformed by the introduction of exogenous genes (DNA) into the said cell. A cell which has undergone 
such a transformation is known as a "transformant", and is characterized by the capability for replication of 

ao the exogenous DNA either as a extranuclear component or in a form integrated into the chromosomes of 
the said cell. 

Next, the method employed for the determination of the DNA sequence of BLase of the present 
invention will be described in the order of the processes involved. This DNA sequence was determined by 
the analysis of the genome DNA of the Bacillus licheniformis ATCC No. 14580 using a combination of PCR 
45 analysis, Southern blotting, direct sequencing techniques, etc. 

(1 ) PCR analysis of genome DNA sequence 

A DNA sequence encoding BLase can be obtained, for example, from genome DNA. In order to obtain 
so the DNA, first, the genome DNA of Bacillus licheniformis ATCC No. 14580 is isolated from cultured cells of 
the said strain by the conventional technique (M. Stahl et al., J. Bacteriology, 154 , 406-412, 1983). This 
genome DNA is used as the template DNA for PCR analysis. The oligonucleotide primers used for PCR are 
synthesized by conventional methods on the basis of the amino acid sequence in the vicinity of the amino 
terminus of the purified enzyme, determined in Section IV Item (8) above, and/or the amino acid sequences 
55 of the peptides obtained by partial hydrolysis of the said enzyme. For example, the oligonucleotide 
encoding the amino acid sequence Thr Asn Thr Thr Ala Tyr Pro Tyr which corresponds to the 12th through 
19th positions reckoned from the amino terminus of BLase (see Table 3) is used as sense primer BL8 
(shown by SEQ ID NO: 2). This oligonucleotide is a tricosamer which encodes the amino acid sequence 
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upto the second base of the triplet codon for the tyrosine residue of c-terminus. 

T T T A 

5 BL8: 5 • - AC AACAC AC GCTTACCC TA 

C C C G 

Separately, another oligonucleotide primer is synthesized on the basis of a peptide which is obtained by the 
io decomposition of the purified enzyme with lysylendopeptidase followed by sequencing and the sequence of 
which is most reliable. As described in Example 2 below, the sequence Gly Tyr Pro Gly Asp Lys (SEQ ID 
NO: 8) is obtained, hence, the octadecamer complementary to an oligonucleotide encoding this amino acid 
sequence is used as antisense primer BL83 (shown by SEQ ID NO: 3). 

75 T A T C T 

BL83: 5' - TT TC CC GGATA CC 
C G G A G 

20 

Then, PCR is performed using the aforesaid genome DNA, the sense primer BL8, and the antisense primer 
BL83, thereby extending and amplifying the target DNA strands in the genome DNA. The PCR products so 
obtained are subjected to agarose gel electrophoresis, thereby obtaining a DNA fragment of approximately 
370 bp. This DNA fragment is incorporated into a suitable vector, and after subcloning, the base sequence 
25 of the fragment is determined by the Sanger technique. The aforesaid amino acid sequence Gly Tyr Pro 
Gly Asp Lys which constituted the basis for the preparation of the antisense primer BL83 was identified as 
that located in positions 131-136 in the amino acid sequence of BLase. 

(2) Southern blotting analysis of genome DNA 

30 

The genome DNA derived from the Bacillus licheniformis ATCC No. 14580, prepared in Item (1) above, 
is digested with the restriction enzyme Sail, and after separation by agarose gel electrophoresis, the DNA 
fragments so obtained are blotted onto a nylon membrane filter, and analyzed by the Southern technique. 
The probe used for hybridization is the BL8-BL83 PCR product obtained in Item (1) above, labelled with 
35 ^P-dCTP by the conventional method. The DNA fragment which displays positive hybridization to this BL8- 
BL83 product is recognized as a band corresponding to a length of approximately 3.1 kb. 

(3) Sequencing of genome DNA by PCR 

40 The genome DNA of the Bacillus licheniformis ATCC No. 14580, obtained in Item (1) above, is digested 
with Sai l, then this digest is incorporated into a suitable vector, for example, pUC119 vector, and a PCR is 
conducted using a portion of the known DNA sequence as a primer. For example, a portion of the DNA 
sequence of pUC119 located upstream to the aforesaid genome DNA is used as sense primer RV (shown 
by SEQ ID NO: 4), and a DNA sequence complementary to a sequence in the vicinity of the 3* terminus of 

45 the 375 bp DNA fragment analyzed in Item (2) above is used as antisense primer B125 (shown by SEQ ID 
NO: 5). 

RV: 5* - CAGGAAACAGCTATGAC 

B125: 5' - TGTCCCAACAAGTGATGA 
A DNA fragment of approximately 1050 bp is obtained by the PCR. The base sequence of this fragment 
so can be determined by the direct DNA sequencing method (Gibbs et a!., Pro. Natl. Acad. Sci. U.S.A., 86, 
1919-1923, 1989). In this manner, the DNA sequence encoding BLase can be ascertained from the amino 
terminus up to the middle portion of the sequence. 

Next, the portion of the sequence on the 3' side of the genome DNA can be determined by the 
following procedure. First, in the same manner as indicated above, the genome DNA of Bacillus lichenifor- 
55 mis ATCC No. 14580 is digested with Sail, and a fragment of approximately 3.1 kb is isolated. This is 
inserted into M13mp11, and a PCR is conducted. The primers used for this PCR are partial fragments of 
the 375 bp DNA fragment analyzed in Item (2) above; one is sense primer B40 (shown by SEQ ID NO: 6) 
which is located upstream to the aforesaid antisense primer B125, and the other is antisense primer M4 
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(shown by SEQ ID NO: 7) which has a DNA sequence complementary to a portion of the DNA sequence of 
M13mp11, and is located downstream to the genome DNA. 
B40: 5* - AAAACCGTCGCAACAGCC 
M4: 5* - GTTTTCCCAGTCACGAC 
The aforesaid PCR yields a DNA fragment of approximately 2.2 kb. The base sequence of this DNA 
fragment can be determined by the direct DNA sequencing method. In this manner, the base sequence 
from the 3' terminus to the middle portion of the genome DNA is determined. 

The complete DNA sequence of BLase determined in this manner as well as the amino acid sequence 
determined from this DNA sequence are shown in SEQ ID NO: 1 and Figure 1. From SEQ ID NO: 1 and 
Figure 1, it is recognized that the gene encoding the mature protein derived from Bacillus licheniformis 
contains a DNA sequence encoding a signal peptide composed of the 94 amino acid residues from N- 
formylmethionione residue in the -94 position to the lysine residue in the -1 position, and a DNA sequence 
encoding the mature protein composed of the 222 amino acid residues from the serine residue in the + 1 
position to the glutamine residue at the +222 position. Ordinarily, ATG codes for methionine, but in this 
case TTG (fMet) appears to be the translation start codon. In the 332 bp segment of the 5 f untranslated 
region starting from the Sail cleavage site, there are a promoter region containing a -35 sequence, a 
Pribnow box, and a Shine-Dalgarno sequence which is present 9 bases upstream from the aforesaid 
inferred translation start codon TTG. In the 3* untranslated region, an inverted complementary repeat 
composed of 13 base pairs is located 8 bases downstream from the stop codon TAA. 

VI. Construction of expression vectors 

As shown in Figure 2, pHY300BLtt, an example of the expression vectors of the present invention, is 
obtained from the shuttle vector pHY300PLKtt, which contains an alkaline protease terminator derived from 
25 Bacillus subtilis ATCC No. 6051, by inserting a DNA fragment encoding BLase of the present invention 
shown in SEQ ID NO: 1 (i.e., a DNA fragment containing a promoter, a DNA sequence encoding a signal 
peptide, a DNA sequence encoding the mature peptide of BLase, and a terminator) into the said vector 
pHY300PLKtt. As shown in Figure 3, the aforesaid vector pHY300PLKtt is obtained from a vector 
pHY300PLK which is a shuttle vector of E. coli and B. subtilis , by inserting an alkaline protease terminator 
30 derived from Bacillus subtilis ATCC No. 6051 into the vector pHY300PLK. 

The aforesaid procedure will now be further explained in the order of the processes involved. First, as 
shown in Figure 3, genome DNA is isolated from the Bacillus subtilis ATCC No. 6051 by the method of M. 
Stahl et al. (supra.), and this is employed as template DNA. Next, a fragment composed of a DNA sequence 
corresponding to the vicinity of the 5' terminus of the terminator portion of the alkaline protease gene 
35 derived from the Bacillus subtilis 1-1 68, with an added Xba l cleavage site, and a fragment complementary to 
a DNA sequence corresponding to the vicinity of the 3* terminus of the terminator portion, with an added 
Hind II I cleavage site, are synthesized chemically, and a PCR is conducted using these fragments as 
primers. The DNA fragment so obtained is then cleaved with Xba l and Hindlll, thereby obtaining a fragment 

(1) shown in Figure 3. Next, pHY300PLK is cleaved with Xba l and Hindlll, thereby obtaining the larger 
40 fragment (2) shown in Figure 3. The shuttle vector pHY300PLKtt, containing the alkaline protease terminator 

derived from Bacillus subtilis ATCC No. 6051, is then constructed by the ligation of these fragments (1) and 

(2) . 

Next, genome DNA is isolated from cultured cells derived from Bacillus licheniformis ATCC No. 14580 
and used as template DNA. Then, a fragment composed of a DNA sequence corresponding to the vicinity 

45 of the 5' terminus of this template DNA with an added Eco RI cleavage site and a fragment complementary 
to the DNA sequence corresponding to the 3' terminus of the template DNA with an added Xba l cleavage 
site are synthesized and used as the sense and antisense primers, respectively. A PCR is conducted using 
the aforesaid template DNA, sense primer, and antisense primer. Then, the fragment so obtained is cleaved 
with Eco RI and Xbal, thereby obtaining a DNA fragment (3) encoding BLase (see Figure 2). This fragment 

so (3) contains a promoter, a DNA sequence encoding a signal peptide, a DNA sequence encoding mature 
BLase, and a terminator. Next, the aforesaid vector pHY300PLKtt is cleaved with Eco RI and Xba l, thereby 
obtaining the larger fragment (4). The expression vector pHY300BLtt of the present invention is then 
obtained by ligating the aforesaid fragments (3) and (4) (see Figure 2). 

This expression vector pHY300BLtt contains, under the control of the BLase promoter, a DNA sequence 

55 encoding the signal peptide from the N-formylmethionine residue in the -94 position to the lysine residue in 
the -1 position; a DNA sequence encoding a mature peptide extending from the serine residue in the + 1 
position to the glutamine residue in the +222 position of BLase; and a 3' untranslated region comprising a 
terminator. Still further downstream, the terminator of the alkaline protease derived from Bacillus subtilis 



10 



75 



12 



EP 0 482 879 B1 



ATCC No. 6051 is present. 

VII. Preparation of transformants and production of BLase 

5 The expression vector obtained in Section VI above is introduced into suitable host cells by a 

conventional method. For example, the aforesaid vector pHY300BLtt can be introduced into Bacillus subtilis 
ISW1214 (Takara Shuzo) by the method of J. Spiezen et al. (Proc. Natl. Acad. Sci. U.S.A. 44, 1072, 1958). 
The transformant ( Bacillus subtilis pHY300BLtt/ISW1214) is cultivated in any medium suitable for the host, 
thereby producing BLase of the present invention. Finally, BLase is isolated from the culture broth wherein 

w the transformant has been grown and purified by the process described in Section II above. 

EXAMPLES 

The present invention will now be further explained with reference to the specific examples. 

75 

Example 1 

Bacillus licheniformis ATCC No. 14580 was cultivated at 28 *C for 36 hours in a medium of pH 7.0 
containing 2.0% of glucose, 2.0% of soybean meal, 0.25% of corn steep liquor, 0.5% of ammonium sulfate, 

20 0.05% of dipotassium hydrogen phosphate, 0.05% of magnesium sulfate heptahydrate, 0.01% of ferrous 
sulfate heptahydrate, and 0.3% of calcium carbonate. Ninety five liters of the culture broth were filter 
pressed, and concentrated to approximately 14 liters by means of an ultrafiltration module (Nitto Ultrafil- 
tration Module NTU 2020T P18B (HF); cutoff MW 20,000) and a centrifuge (4200 rpm, 30 minutes). This 
concentrated cell-free broth was diluted to approximately 28 liters (1.90 ms/cm) with 2 mM calcium chloride. 

25 Then the pH of the diluted cell-free broth was adjusted to 6.0 by addition of hydrochloric acid. To this was 
added approximately 4 liters of Amberlite CG-50 which had been equilibrated with a 10 mM acetate buffer 
(pH 6.0) containing 2 mM calcium chloride, and the mixture was agitated for 4 hours at room temperature. 
After verifying that the supernatant had no BLase activity, the supernatant was discarded. Then the 
Amberlite CG-50 was packed into a 14 x 32 cm glass column. After washing with approximately 10 liters of 

30 10 mM acetate buffer (pH 6.0) containing 2 mM calcium chloride, elution was performed with 0.5 M sodium 
acetate buffer (pH 8.5) also containing 2 mM calcium chloride. 

The fractions having BLase activity eluted from the Amberlite CG-50 were combined (the total volume 
of the fractions was 2.7 liters) and dialyzed against water for 48 hours. The dialyzate was diluted to 8 liters 
(2.23 ms/cm) with 2 mM calcium chloride, and after adjustment to pH 6.0, the fluid was adsorbed onto 

35 approximately 800 ml of S-Sepharose, packed in a 5 x 40 cm column, which had been equilibrated 
beforehand with a 5 mM acetate buffer solution (pH 6.0) containing 2 mM calcium chloride. After washing 
with approximately 5 liters of buffer solution with the same composition as that used for the above- 
mentioned equilibration, the column was subjected to elution with 7 liters of this buffer solution under a 
linear gradient of 0-0.2 M sodium chloride. The fractions having BLase activity were combined (the total 

40 volume of the fractions was 900 ml), and dialyzed overnight against a 2 mM aqueous solution of calcium 
chloride, thereby obtaining approximately 950 ml of dialyzate (0.86 ms/cm). The pH of this dialyzate was 
adjusted to 7.5, then promptly subjected to affinity chromatography. The carrier used in this affinity 
chromatography process was approximately 340 ml of CH Sepharose 4B (Phe Leu-D-GluOMe) packed into 
a 3 x 48 cm column, and equilibrated with 5 mM Tris-HCI (pH 7.5) containing 2 mM calcium chloride. After 

45 adsorbing the aforesaid dialyzate in this column, the column was washed with approximately 5 liters of 
buffer solution with the same composition as that used for equilibration of the column, and then subjected to 
elution with 3.5 liters of the buffer solution of the same composition under a linear gradient of 0-0.7 M 
sodium chloride. 

The BLase activity of each fraction so obtained was measured by the method for the measurement of 
60 enzymatic activity shown in Section III of Description of the preferred Embodiments. The results are shown 
in Figure 4. The 280 nm absorbance of each fraction was measured as an index of protein concentration, 
and the results so obtained are also shown in Figure 4. This figure shows that the BLase was eluted at a 
concentration of approximately 0.5 M of sodium chloride. The BLase so obtained displayed a single band in 
SDS-PAGE. In this manner, an 833.1 mg specimen of the said enzyme (quantitated with a Bio Rad protein 
55 assay kit), with specific activity of 1.9 x 10 3 - 2 x 10 3 U/mg, was obtained from 95 liters of the culture broth. 
The yield of the enzymatic activity was 27.5%. 
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Example 2 

Determination of base sequence of DNA encoding BLase 

5 (1 ) PCR analysts of internal base sequence of genome DNA 

One hundred micrograms of the purified BLase obtained in Example 1 which had been treated with 
DFP was added in 1 50 ul of 0.05 M Tris-HCI (pH 9.0) containing 1 M urea, and digested with 1 ug of lysyl 
endopeptidase (Wako Pure Chemical Industries, Ltd.) at 37 *C for 5 hours. The resulting enzyme digest was 

70 then isolated and purified by high-performance liquid chromatography using a column packed with TSKgel 
ODS-120T (4.6 x 250 mm, Tosoh Co. Ltd.) The amino acid sequences of the digested fragments so 
obtained were investigated with a Model 477A Protein Sequencer (Applied Biosystems), thereby determin- 
ing the amino acid sequences of five types of fragments. Three of these sequences are indicated below as 
well as in SEQ ID NOS: 8, 9, and 10. 

75 Gly-Tyr-Pro-Gly-Asp-Lys (I) 

Ala-lle-Val-His-lle (II) 
Ser-Thr-Arg-Tyr-Phe-lle-Pro-Ser (III) 
Next, genome DNA was isolated from Bacillus licheniformis ATCC No. 14580 by the method of M. Stahl et 
al. (supra.), and the DNA was used as a template for PCR analysis. The oligonucleotide primers used for 

20 the PCR were prepared on the basis of known portions of the amino acid sequence of the BLase produced 
by Bacillus licheniformis ATCC No. 14580. First, an oligonucleotide encoding the amino acid sequence 
beginning with the 12th position and terminating with the 19th position from the amino terminus of the 
BLase molecule (see Table 3), that is, Thr Asn Thr Thr Ala Tyr Pro Tyr (except that the said oligonucleotide 
only extends through the second base of the triplet coding for the tyrosine residue at the carboxyl terminal 

25 of the oligopeptide, i.e., the said oligonucleotide is a tricosamer) was synthesized chemically. This was used 
as sense primer BL8 (shown by SEQ ID NO: 2). 

• T T T A 

30 BL8: 5 ' - AC AACAC AC GCTTACCC TA 

C C C G 

Next, an octadecamer complimentary to the DNA sequence encoding an amino acid fragment which is a 
35 product obtained in the aforesaid lysyl endopeptidase digestion and which has an amino acid sequence 
with the greatest degree of reliability [i.e., Gly Tyr Pro Gly Asp Lys (shown by SEQ ID NO: 8] was 
synthesized chemically. This was used as antisense primer BL83 (shown by SEQ ID NO: 3). 

40 T A T C T 

BL83: 5' - TT TCCC GGATA CC 
C G G A G 

45 Using the aforesaid template DNA and oligonucleotide primers, the DNA was amplified by the PCR 
method (Saiki et al., Science 239 , 487-491 , 1989). A portion of the amplified products were analyzed by 1% 
agarose gel electrophoresis, thereby confirming the presence of an approximately 370 bp DNA fragment. 
This fragment was isolated and, after blunting the ends with Klenow fragment, the fragment was cloned in 
M13mp11 which had been digested with Sma l, and the DNA sequence of the fragment was determined by 

so the Sanger method (Sanger et al., Proc. Natl. Acad. Sci. U.S.A., 74, 5463-5467, 1977). In this manner, the 
base sequence of a 375 bp fragment was determined, and the amino acid sequence corresponding to BL83 
was found to be located at positions 131 through 136. Furthermore, the aforesaid amino acid sequences (II) 
and (III) were found to be located at positions 21 through 25 and 79 through 86, respectively. 

55 (2) Southern blotting analysis of genome DNA 

The genome DNA derived from Bacillus licheniformis ATCC No. 14580 obtained by the procedure 
described in Item (1) above was digested with the restriction enzyme Sail, and after separation of the 
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products by 1% agarose gel electrophoresis, the DNA fragments were blotted onto a nylon membrane filter 
and analyzed by the Southern technique. The probe used for the hybridization was the BL8-BL83 PCR 
product obtained in Item (1) above, and labelled with ^P-dCTP by the conventional method. The DNA 
fragment which displayed positive hybridization to this BL8-BL83 product was recognized as a band 
5 corresponding to a length of approximately 3.1 kb. 

(3) Determination of genome DNA sequence by PCR 

The genome DNA of Bacillus licheniformis ATCC No. 14580, obtained in Item (1) above, was digested 

10 with Sail, then the ends of the fragments were blunted with T4 DNA polymerase. This blunt-end fragment 
was ligated with a dephosphorylated Sma l-digested pUCH9 vector. This ligation reaction was performed 
with a commercial kit (Takara Shuzo). 

Next, the sense primer RV (shown by SEQ ID NO: 4) and the antisense primer B125 (shown by SEQ ID 
NO: 5), with the base sequences also indicated below, were synthesized chemically and added to an aliquot 

75 of the reaction mixture for the aforesaid ligation reaction, allowing a PCR reaction to be carried out. 

The sense primer RV is a portion of the DNA sequence of pUC119 and is located upstream to the 
aforesaid genome DNA, while the antisense primer B125 is a DNA sequence complementary to a sequence 
in the vicinity of the 3' terminus of the 375 bp DNA fragment analyzed in Item (2) above. 
RV: 5' - CAGGAAACAGCTATGAC 

20 B125: 5' - TGTCCCAACAAGTGATGA 

A DNA fragment of approximately 1050 bp was obtained by the aforesaid PCR. The base sequence of this 
fragment was determined by the direct DNA sequencing method (Gibbs et al., Pro. Natl. Acad. Sci. U.S.A., 
86 , 1919-1923, 1989). In this manner, the DNA sequence encoding the present enzyme was ascertained 
from the 5' terminus up to the middle portion of the sequence. 

25 The genome DNA of the Bacillus licheniformis ATCC No. 14580, obtained in Item (1) above, was 
digested with Sail and subjected to 1% agarose gel electrophoresis, thereby isolating an approximately 3.1 
kb fragment. This fragment was blunt ended with Klenow fragment, and was ligated with dephosphorylated 
Smal -digested fragments of M13mp11. Using this as a template, a PCR was conducted. The primers used 
for this PCR were sense primer B40 (shown by SEQ ID NO: 6) which is a portion of the 375 bp DNA 

30 fragment analyzed in Item (2) above (upstream to the aforesaid primer B125), and antisense primer M4 
(shown by SEQ ID NO: 7) which is a DNA sequence complementary to a portion of the DNA sequence of 
M13mp11 located downstream to the genome DNA. 
M4: 5* - GTTTTCCCAGTCACGAC 
B40: 5' - AAAACCGTCGCAACAGCC 

as The aforesaid PCR yielded an approximately 2.2 kb DNA fragment. The base sequence of this DNA 
fragment was determined by the direct DNA sequencing method. In this manner, the base sequence from 
the 3' end to the middle portion of the genome DNA was determined. 

The complete DNA sequence of BLase determined in this manner as well as the amino acid sequence 
deduced from the DNA sequence are shown by SEQ ID NO: 1 and Figures 1-1 to 1-3. 

40 The DNA sequence of BLase, as well as DNA sequences which hybridize to the said DNA sequence 
are also useful for producing a protease with BLase activity which is also within the scope of the present 
invention. The DNA sequence which hybridizes to the DNA sequence of BLase can be obtained, for 
example, by the following process. 

Various DNA fragments, for example, DNA fragments derived from various organisms are screened by 

45 the use of whole or a part of the DNA sequence of BLase, e.g. a 1124 bp DNA fragment which is from A in 
the 248 position to T in the 1371 position of SEQ ID NO:1, as a probe. For example, the Southern 
hybridization technique (Southern, E. M„ J. Mol. Biol. 98, 503-517, 1975) is employed by the use of the M p- 
labelled probe, and a hybridization buffer having the following composition. 
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NaH 2 P0 4 (pH 7.2) 
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After the hybridization is carried out at 65 "C overnight, a filter, to which the probe has been hybridized, 
is washed 4 times at room temperature with 2 x SSC, 0.1% SDS, four times each wash being carried out 
for 10 minutes, at room temperature, thus obtaining a DNA fragment which has about 65% homology with 
the DNA sequence of BLase. When the filter is washed once for 20 minutes at 50 *C, a DNA fragment 
5 which has about 80% homology with the DNA sequence of BLase can be obtained. 

(4) Construction of expression vector 

(4)-1 Construction of shuttle vector pHY300PLKtt 

10 

A genome DNA was isolated from Bacillus subtilis ATCC No. 6051 by the method of M. Stahl et at. 
(supra.), and this was employed as a template DNA. Next, as primers, a fragment composed of a DNA 
sequence corresponding to the vicinity of the 5' terminus of the terminator portion of an alkaline protease 
gene derived from Bacillus subtilis 1-168 (Journal of Bacteriology 158 , 411-418, 1984), with an added Xbal 
15 cleavage site (sense primer A, shown by SEQ ID NO: 11), and a fragment complementary to a DNA 
sequence corresponding to the vicinity of the 3' terminus of the terminator portion, with an added Hindlll 
cleavage site (antisense primer B, shown by SEQ ID NO: 12), were synthesized chemically. 

Sense primer A: 

5' - GAGTCTAGAGC AG CTGC AC AATAATAG -3' 
20 Antisense primer B: 

5' - G AG AAG CTTG AC AG AG AACAG AG AAG -3' 
Then, a PCR was conducted using the aforesaid template DNA and these two primers. The DNA fragment 
so obtained was then cleaved with Xba l and Hindlll, thereby obtaining fragment (1) shown in Figure 3. Next, 
the shuttle vector pHY300PLK (Takara Shuzo) was cleaved with Xba l and Hindlll, thereby obtaining the 
25 larger fragment (2) (see Figure 3). A shuttle vector pHY300PLKtt, containing the alkaline protease terminator 
derived from Bacillus subtilis ATCC No. 6051, was then constructed by ligation of these fragments (1) and 
(2). 

(4)-2 Construction of expression vector pHY300BLtt 

30 

A genome DNA was isolated from cultured cells of Bacillus licheniformis ATCC No. 14580 by the 
method of M. Stahl et al. (supra.) and used as a template DNA. Then, a single-stranded DNA fragment 
corresponding to the vicinity of the 5' terminus of this template DNA with an added Eco Rl cleavage site and 
a single-stranded DNA fragment complementary to a DNA sequence corresponding to the 3* terminus of 
35 the template DNA with an added Xba l cleavage site were synthesized and used as sense primer C (shown 
by SEQ ID NO: 13) and antisense primer D (shown by SEQ ID NO: 14), respectively. 
Sense primer C: 

5'- CAAGAATTCGGCTTCCCGTGCGCCTCC - 3' 
Antisense primer D: 

40 5'- TTGTCTAGAATTTGCCGATCAGCGGTC - 3' 

A PCR was conducted using the aforesaid template DNA, sense primer C, and antisense primer D. Then, 
the fragment so obtained was cleaved with Eco Rl and Xba l, thereby obtaining a DNA fragment (3) encoding 
BLase (see Figure 2). Next, the aforesaid vector pHY300PLKtt was cleaved with Eco Rl and Xba l, thereby 
obtaining the larger fragment (4). The aforesaid fragments (3) and (4) were then ligated with T4 DNA ligase. 

45 The ligation mixture was used to transform E. coli K-12 C600. The transformants were cultivated on an agar 
plate medium containing ampicillin, and the ampillicin-resistant colonies were selected. Then, plasmid DNA 
was isolated from the cells of the selected colonies, and the insertion of the aforesaid DNA fragment (3) in 
the correct direction was verified from restriction enzyme cleavage patterns. 

so (5) Preparation of transformants and production of BLase 

The expression vector pHY300BLtt obtained in Item (4) above was introduced into Bacillus subtilis 
ISW1214 (Takara Shuzo) by the method of J. Spizien et al. (Proc. Natl. Acad. Sci. U.S.A. 44, 1072 (1958)). 
The bacteria were then cultivated on a plated agar medium containing tetracycline, and the tetracycline- 
55 resistant colonies were selected, thus obtaining the desired transformant ( Bacillus subtilis 
pHY300BLtt/ISW1214). 

The transformant was transplanted to 5 ml of LB medium (10 g trypton, 5 g yeast extract, and 5 g 
sodium chloride with water added to yield a total volume of 1 liter; pH 7.2) and shake-cultured at 37 *C for 
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18 hours. Then, 1 ml of this culture broth was added to 10 ml of Sc + medium which had been sterilized in 
an autoclave at 120 *C for 20 minutes, and shake-cultured was carried out at 28 *C. 



Composition of Sc medium 



Soluble starch 
Glycerol 
Bacto soytone 
CSL 

Yeast extract 
Calcium carbonate 



lOg 
5g. 
5g 

2. 5g 

ig 
3g 



Water is added so that the total 
& volume should be 1 L (pH 7.0) 

The aforesaid culture broth was centrifuged at 2500 x g for 5 minutes, and the supernatant was 
obtained. The BLase activity of this supernatant was measured by the method for the measurement of 
25 enzymatic activity described in the Description of the Preferred Embodiments. The results of these 
measurements are shown in Table 4. The quantity of protein per liter of the culture broth was calculated on 
the basis of an assumed BLase specific activity of 2500 U/mg. 

n Q Table 4 



35 



Cultivation 
period (days) 



BLase 
activity 
(units /ml) 



Protein 
content 
(mg/L) 



1, 
2 
3 
4 
5 



15 
56 
72 
79 
69 



6.0 
22.4 
28.8 
31.6 
27.6 



As described above, the present invention provides a new protease which specifically cleaves the 
so peptide bonds at the carboxyl termini of glutamic acid residues in the amino acid sequences of polypep- 
tides, a method for the preparation of the protease from bacteria of genus Bacillus , a DNA sequence 
encoding the protease, an expression vector containing the DNA sequence, a transformant obtained by 
introduction of the expression vector into a host, and a method for the production of the protease using the 
transformant. This type of protease can be utilized for a variety of purposes, such as protein analysis and 
55 cleavage of the peptide chains of fusion proteins at desired sites, etc. 
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SEQ ID NO: 1 

SEQUENCE TYPE: Nucleotide with corresponding protein 
SEQUENCE LENGTH: 1448 base pairs 

5 STRANDEDNESS: double 

TOPOLOGY: linear 
MOLECULE TYPE: genomic DNA 

ORIGINAL SOURCE 

ORGANISM: Bacillus 1 icheni f ormi s 
10 STRAIN: ATCC NO. 14580 

FEATURES: 

from 323 to 1270 bp CDS(E) 

from 323 to 604 bp signal peptide(E) 

from 605 to 1270 bp mature peptide(E) 

75 

OTHER INFORMATION: 

x aa at position of amino acid sequence: formyl methionine 



TCGACGGCTT CCCGTGCGCC TCCGGGATCG CTGTGATAAT TGACAACCAC ATTCATCTTT 60 
TCTTTTCCAA ACCGTTCTGC AACCGCCTTG CCTATACCTT TTGAAGAGCC GGTCACAATT 120 
GCTGTTTTTC CTTTTAAATC ACTATACAAC CTAAACACCC CTCAATTTCT TTTCTCCATG 180 
TACATTACCC GGTATCAATA TATGATCAAA CAAAATG T T A ATACACACCT TTAGTATGAT 240 
CTTTTTTAAA CATATGGAAA ATTCAGAATT ATTTTGTTAA TATCTAACTT GTACTTACAA 300 
CAAAATAAGG AAGTGATATG AT TTG GTT AGT AAA AAG AGT GTT AAA CGA GGT 352 

Xaa Val Ser Lys Lys Ser Val Lys Arg Gly 
-94 .-90 -95 

TTG ATC ACA GGT CTC ATT GGT ATT TCT ATT TAT TCT TTA GGT ATG CAC - 400 

Leu lie Thx Gly Leu lie GJLy lie Ser lie Tyr Ser Leu Gly Met His 

-90 -85 -80 

CCG GCC CAA GCC GCG CCA TCG CCT CAT ACT CCT GTT TCA AGC GAT CCT 448 
Pro Ala Gin Ala Ala Pro Ser Pro His Thr Pro Val Ser Ser Asp Pro 
-75 -70 -65 



35 



50 



55 
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40 



TCA TAC AAA GCG GAA ACA TCG GTT ACT TAT GAC CCA AAC ATT AAG AGC 49 6 

Ser Tyr Lys Ala Glu Thr Ser Val Thr Tyr Asp Pro Asn lie Lys Ser 

-60 -55 -50 

GAT CAA TAC GGC TTG TAT TCA AAA GCG TTT ACA GGC ACC GGC AAA GTG 544 
Asp Gin Tyr Gly Leu Tyr Ser Lys Ala Phe Thr Gly Thr Gly Lys Val 

-45 -40 -35 

AAT GAA ACA AAG GAA AAA GCG GAA AAA AAG TCA CCC GCC AAA GCT CCT 592 
Asn Glu Thr Lys Glu Lys Ala Glu Lys Lys Ser Pro Ala Lys Ala Pro 
-30 -25 -20 -15 

TAC AGC ATT AAA TCG GTG ATT GGT TCT GAT GAT CGG ACA AGG GTC ACC 640 
Tyr Ser lie Lys Ser Val lie Gly Ser Asp Asp Arg Thr Arg Val Thr 

-11 5 10 

AAC ACA ACC GCA TAT CCG TAC AGA GCG ATC GTT CAT ATT TCA AGC AGC 688 
Asn Thr Thr Ala Tyr Pro Tyr Arg Ala lie Val His lie Ser Ser Ser 

15 20 25 

ATC GGT TCA TGC ACC GGA TGG ATG ATC GGT CCG AAA ACC GTC GCA ACA 736 
He Gly Ser Cys Thr Gly Trp Met lie Gly Pro Lys Thr Val Ala Thr 

30 35 40 . 

3CC GGA CAC TGC ATC TAT GAC ACA TCA AGC GGT TCA TTT GCC GGT ACA 784 
Ala Gly His Cys lie Tyr Asp Thr Ser Ser Gly Ser Phe Ala Gly Thr 
45 50 55 60 

GCC ACT GTT TC6 CCG GGA CGG AAC GGG ACA AGC TAT CCT TAC GGC TCA 832 
Ala Thr Val Ser Pro Gly Arg Asn Gly Thr Ser Tyr Pro Tyr Gly Ser 
65 70 75 



45 



50 



55 
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20 



30 



35 



40 



GTT AAA TCG ACG CGC TAC TTT ATT CCG TCA GGA TGG AGA AGC GGA AAC 880 
Val Lys Ser Thr Arg Tyr Phe lie Pro Ser Gly Trp Arg Ser Gly Asn 

80 85 90 

ACC AAT TAC GAT TAC GGC GCA ATC GAA CTA AGC GAA CCG ATC GGC AAT 928 
Thr Asn Tyr Asp Tyr Gly Ala lie Glu Leu Ser Glu Pro lie Gly Asn 

95 100 105 

ACT GTC GGA TAC TTC GGA TAC TCG TAC ACT ACT TCA TCA CTT GTT GGG 976 
"^hr Val Gly Tyr Phe Gly Tyr Ser Tyr Thr Thr Ser Ser Leu Val Gly 
110 H5 120 

iCA ACT GTT ACC ATC AGC GGC TAC CCA GGC GAT AAA ACA GCA GGC ACA 1024 
Thr Thr Val Thr lie Ser Gly Tyr Pro Gly Asp Lys Thr Ala Gly Thr 
125 130 135 140 

CAA TGG CAG CAT TCA GGA CCG ATT GCC ATC TCC GAA ACG TAT AAA TTG 1072 
Gin Trp Gin His Ser Gly Pro He Ala He Ser Glu Thr Tyr Lys Leu 

145 150 155 

CAG TAC GCA ATG GAC ACG TAC GGA GGA CAA AGC GGT TCA CCG GTA TTC' 1120 
Gin Tyr Ala Met Asp Thr Tyr Gly Gly Gin Ser Gly 6er Pro Val Phe 

160 165 170 

GAA CAA AGC AGC TCC AGA ACG AAC TGC AGC GGT CCG TGC TCG CTT GCC 1168 
Glu Gin Ser Ser Ser Arg Thr Asn Cys Ser Gly Pro Cys Ser Leu Ala 

175 180 185 

GTA CAC ACA AAT GGA GTA TAC GGC GGC TCC TCG TAC AAC AGA GGC ACC. 1216 
Val His Thr Asn Gly Val Tyr Gly Gly Ser Ser Tyr Asn Arg Gly Thr 
190 195 200 



CGG ATT ACA AAA GAG GTG TTC GAC AAT TTG ACC AAC TGG AAA AAC AGC 1264 
45 Arg lie Thr Lys Glu Val Phe Asp Asn Leu Thr Asn Trp Lys Asn Ser 

205 210 215 220. 

GCA CAA TAATACACGA AGACAGCCCG CTTCCTTTTG GAACGGGCTG TCACATCTAA 1320 
50 Ala Gin 

CGGCCGTATA CTTAATTTCC TTTAAGCCTG TACTTTTTGC CATCTATTGA TATCGTGAAA 1380 
TTTGAAGGAC CGCTGATCGG CAAAT AAT A G ACAAGCTGAA ACTCCGCTTC CTCACCAGGT 1440 
TTGAATGG 1448 
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SEQ ID NO: 2 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 23 base pairs 

STRAND EDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid. Synthetic DNA 
ACYAACACYA CYGCTTACCC RTA 
SEQ ID NO: 3 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 18 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, Synthetic DNA 
YTTRTCKCCM GGATAKCC 
SEQ ID NO: 4 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 17 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, Synthetic DNA 
CAGGAAACAG CTATGAC 
SEQ ID NO: 5 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 18 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, Synthetic DNA 
TGTCCCAACA AGTGATGA 
SEQ ID NO: 6 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 18 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, Synthetic DNA 
AAAACCGTCG CAACAGCC 
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SEQ ID NO: 7 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 17 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid. Synthetic 
GTTTTCCCAG TCACGAC 
SEQ ID NO: 8 

SEQUENCE TYPE: amino acid 
SEQUENCE LENGTH: 6 amino acids 

TOPOLOGY : 1 inear 

MOLECULE TYPE: peptide 

FRAGMENT TYPE: internal fragment 

ORIGINAL SOURCE 

ORGANISM: Bacillus 1 icheni f ormi s 
STRAIN: ATCC No. 14580 

Gly Tyr Pro Gly Asp Lys 
SEQ ID NO: 9 

SEQUENCE TYPE: amino acid 
SEQUENCE LENGTH: 5 amino acids 

TOPOLOGY : linear 

MOLECULE TYPE: peptide 

FRAGMENT TYPE: internal fragment 

ORIGINAL SOURCE: 

ORGANISM: Bacillus licheniformis 
STRAIN: ATCC No. 14580 

Ala lie Val His lie 
1 5 

SEQ ID NO: 10 

SEQUENCE TYPE: amino acid 

SEQUENCE LENGTH: 8 amino acids 

TOPOLOGY: linear 

MOLECULE TYPE: peptide 

FRAGMENT TYPE: internal fragment 
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ORIGINAL SOURCE: 

ORGANISM: Bacillus lichenlf ormis 
STRAIN: ATCC No. 14580 

Ser Thr Arg Tyr Phe lie Pro Ser 
1 5 

SEQ ID NO: 11 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 27 base pairs 

STRAND ED NESS : single 
TOPOLOGY : linear 

MOLECULE TYPE: Other nucleic acid, Synthetic DNA 
GAGTCTAGAG CAGCTGCACA ATAATAG 27 
SEQ ID NO: 12 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 26 base pairs 

STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid. Synthetic DNA 
GAGAAGCTTG ACAGAGAACA GAGAAG 26 



SEQ ID NO: 13 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 27 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, Synthetic DNA 
CAAGAATTCG GCTTCCCGTG CGCCTCC 27 
SEQ ID NO: 14 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 27 base pairs 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, Synthetic DNA 
TTGTCTAGAA TTTGCCGATC AGCGGTC 27 



Claims 

1. A protease, derived from Bacillus licheniformis , which cleaves the peptide bonds at the carboxyl termini 
of glutamic acid residues in polypeptides, and which contains an amino acid sequence from serine in 
the + 1 position to glutamine in the + 222 position of SEQ ID NO: 1 . 

2. A protease of claim 1 , derived from Bacillus licheniformis ATCC No. 1 4580. 

3. A protease of claim 1 , which has the following properties: 
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(1 ) Optimal pH: approximately 8.0, and 

(2) Stable pH range: pH 6.5-8.5 at 25* C. 

4. A protease which contains an amino acid sequence from serine in the + 1 position to glutamine in the 
5 +222 position of SEQ ID NO: 1, and cleaves the peptide bonds at the carboxyl termini of glutamic acid 

residues in polypeptides. 

5. A DNA sequence encoding a protease of claim 1 or claim 4. 

w 6. A DNA sequence of claim 5, which contains a base sequence from the thymine residue in the 605 
position to the adenosine residue in the 1 270 position of SEQ ID NO: 1 . 

7. A DNA sequence of claim 5, which encodes a protease containing an amino acid sequence from N- 
formylmethionine at the -94 position to the glutamine at the +222 position of SEQ ID NO: 1 . 

75 

8. A DNA sequence of claim 7, which contains a base sequence, from the thymine residue in the 323 
position to the adenosine residue in the 1270 position, of SEQ ID NO: 1. 

9. An expression vector containing a DNA sequence of claim 5. 

20 

10. An expression vector of claim 9, which is expressible in bacteria of the genus Bacillus . 

11. A transformant obtainable by introducing the expression vector of claim 9 into a host. 

25 12. A transformant of claim 10, wherein said host is a strain belonging to the genus Bacillus . 

13. A method for producing a protease of claim 1 or 4, comprising the steps of cultivating a strain of 
Bacillus licheniformis capable of producing a protease of claim 1 or 4 in a culture medium and 
recovering the produced protease from the culture medium. 

30 

14. A method for producing a protease comprising the steps of cultivating a transformant of claim 11 in a 
culture medium and recovering the produced protease from the culture medium. 

PatentansprUche 

35 

1. Protease, abgeleitet von Bacillus licheniformis , welche die Peptidbindungen in Polypeptiden an den 
Carboxy-Termini von GlutaminsSureresten spaltet, und welche eine Aminos&uresequenz von Serin an 
der Position +1 bis Glutamin an der Position +222 der SEQ ID NO:1 enthSlt. 

40 2. Protease nach Anspruch 1 , abgeleitet von Bacillus licheniformis ATCC Nr. 1 4580. 

3. Protease nach Anspruch 1 , welche die folgenden Eigenschaften aufweist: 

(1 ) Optimaler pH-Wert: ungefahr 8,0, und 

(2) Stabiler pH-Bereich: pH-Wert 6,5-8,5 bei 25 *C. 

45 

4. Protease, die eine AminosSuresequenz von Serin an der Position +1 bis Glutamin an der Position 
+ 222 der SEQ ID NO: 1 enthSIt, und die die Peptidbindungen in Polypeptiden an den Carboxy-Termini 
von GlutaminsSureresten spaltet. 

so 5. DNA-Sequenz, die fu*r eine Protease gemafl Anspruch 1 oder Anspruch 4 kodiert. 

6. DNA-Sequenz nach Anspruch 5, die eine Basensequenz vom Thyminrest an der Position 605 bis zum 
Adenosinrest an der Position 1270 der SEQ ID NO: 1 enthSIt 

55 7. DNA-Sequenz nach Anspruch 5, die fOr eine Protease kodiert, welche eine AminosSuresequenz von N- 
Formylmethionin an der Position -94 bis zum Glutamin an der Position +222 der SEQ ID NO: 1 enthalt. 
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8. DNA-Sequenz nach Anspruch 7, die eine Basensequenz vom Thyminrest an der Position 323 bis zum 
Adenosinrest an der Position 1270 der SEQ ID NO: 1 enthSlt. 

9. Expressionsvektor, der eine DNA-Sequenz gemaB Anspruch 5 enthSlt. 

5 

10. Expressionsvektor nach Anspruch 9, welcher in Bakterien der Gattung Bacillus exprimierbar ist. 

11. Transformante, erhaltlich durch EinfUhrung des Expressionsvektors gemaB Anspruch 9 in einen Wirt. 
io 12. Transformante nach Anspruch 10, wobei der Wirt ein Stamm ist, der zur Gattung Bacillus gehGrt. 

13. Verfahren zur Herstellung einer Protease gemaB Anspruch 1 oder 4, umfassend die Schritte des 
Kultivierens eines Stammes von Bacillus licheniformis , welcher in der Lage ist, eine Protease gemaB 
Anspruch 1 oder 4 zu produzieren, in einem Kulturmedium, und des Gewinnens der produzierten 

75 Protease aus dem Kulturmedium. 

14. Verfahren zur Herstellung einer Protease, umfassend die Schritte des Kultivierens einer Transformante 
gem^S Anspruch 11 in einem Kulturmedium, und des Gewinnens der produzierten Protease aus dem 
Kulturmedium. 

20 

Revendlcations 

1. Une protease, obtenue a partir de Bacillus licheniformis , qui clive les liaisons peptidiques au niveau de 
Pextre'mite' carboxylique des r^sidus acide glutamique dans les polypeptides, et qui cOntient une 

25 sequence aminoacide a partir d'une serine a la position +1 jusqu'a une glutamine a la position +222 

de SEQ ID NO: 1. 

2. Une protease selon la revendication 1 , obtenue a partir de Bacillus licheniformis ATCC No. 14580. 

30 3. Une protease selon la revendication 1 , qui a les propri6t6s suivantes : 

(1) pH optimal : environ 8,0, et 

(2) zone de stability du pH : pH 6,5 - 8,5 a 25 * C. 

4. Une protease qui contient une sequence aminoacide a partir d'une serine en position + 1 jusqu'a une 
35 glutamine en position +222 de SEQ ID NO: 1, et qui clive les liaisons peptidiques au niveau de 

Pextr^mite' carboxylique des r^sidus acide glutamique dans les polypeptides. 

5. Une sequence ADN codant pour une protease selon la revendication 1 ou selon la revendication 4. 

40 6. Une sequence ADN selon la revendication 5, qui contient une sequence en bases a partir du re*sidu 
thymine a la position 605 jusqu'au residu adenosine a la position 1270 de SEQ ID NO: 1. 

7. Une sequence ADN selon la revendication 5, qui code pour une protease contenant une sequence 
aminoacide a partir d'une N-formylm^thionine a la position -94 jusqu'a une glutamine a la position 

45 +222 de la SEQ ID NO: 1. 

8. Une sequence ADN selon la revendication 7, qui contient une sequence en bases, a partir du residu 
thymine a la position 323 jusqu'au r£sidu adenosine a la position 1270, de SEQ ID NO: 1. 

so 9. Un vecteur d'expression contenant une sequence ADN selon la revendication 5. 

10. Un vecteur d'expression selon la revendication 9, qui est exprimable dans des bacteVies du genre 
Bacillus . 

55 11. Un transformant que I'on peut obtenir en introduisant le vecteur d'expression de la revendication 9 dans 
un note. 
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12. Un transformant selon la revendication 10, pour lequel ledit note est une souche appartenant au genre 
Bacillus . 

13. Un procede de production d'une protease selon la revendication 1 ou 4, comprenant les Stapes de 
s culture d'une souche de Bacillus licheniformis capable de produire une protease selon la revendication 

1 ou 4 dans un milieu de culture et de recuperation de la protease produite a partir du milieu de 
culture. 

14. Un procede de production d'une protease comprenant les Stapes de culture d'un transformant selon la 
70 revendication 11 dans un milieu de culture et de recuperation de la protease produite a partir du milieu 

de culture. 
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F: 



1 T CGA CGG CTT CCC GTG CGC CTC C OG GAT CGC TGT GAT AAT TGA CAA 

Sense pnmer CO 

47 CCA CAT TCA TCT TTT CTT TTC CAA ACC GTT CTG CAA CCG CCT TGC CTA 

9 5 TAC CTT TTG AAG AGC CGG TCA CAA TTG CTG TTT TTC CTT TTA AAT CAC 

14 3 TAT ACA ACC TAA ACA CCC CTC AAT TTC TTT TCT CCA. TGT ACA TTA CCC 

191 GGT ATC AAT ATA TGA TCA AAC AAA ATG TTA ATA CAC ACC TTT AGT ATG 

2 39 ATC TTT TTT AAA CAT ATG GAA AAT TCA GAA TTA TTT TGT TAA TAT CTA 

2B7 ACT TGT ACT TAC AAC AAA ATA AGG AAG TGA TAT GAT TTG GTT AGT AAA 
1 frtet val Ser Lys 

33 5 AAG AGT GTT AAA CGA GGT TTG ATC ACA GGT CTC ATT GGT ATT TCT ATT 

(-80) 

5 Lys Ser Val Lys Arg Gly Leu He Thr Gly Leu He Gly He Ser lie 

36 3 TAT TCT TTA GGT ATG CAC CCG GCC CAA GCC GCG CCA TCG CCT CAT ACT 
21 Tyr Ser Leu Gly Met His Pro Ala Gin. Ala Ala Pro Ser Pro Kis Thr 

431 CCT GTT TCA AGC GAT CCT TCA TAC AAA GCG GAA ACA TCG GTT ACT TAT 

(-50) 

. 3 7 Pro Val Ser Ser Asp Pro Ser Tyr Lys Ala Glu Thr Ser Val Thr Tyr 

47 9 GAC CCA AAC ATT AAG AGC GAT CAA TAC GGC TTG TAT TCA AAA GCG TTT 
C-*«0 (-30) 
5 3 Asp Pro Asn He Lys Ser- Asp Gin Tyr Gly Leu Tyr Ser Lys Ala Phe 

52 7 ACA GGC ACC GGC AAA GTG AAT GAA ACA AAG GAA AAA GCG GAA AAA AAG 
59 Thr Gly Thr Gly Lys Val Asn Glu Thr Lys Glu Lys Ala Glu Lys* Lys 
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F^.I-2 

575 TCA CCC CCC AAA GCT CCT 7AC ACC ATT AAA TCG CTG ATT GGT TCT GAT 
(-ID) (-U I 

e5 Ser Pro Ala Lys Ala Pro Tyr Scr lie Lys Ser Val lie Gly Ser Asp 

623 GAT CGG ACA AGG GTC ACC AAC ACA ACC GCA TAT CCG TAC AGA GCG ATC 
101 Asp Arg Thr Arg Val Thr Asn Thr Thr Ala Tyr Pro Tyr Arg Ala He . 

671 GTT CAT ATT TCA AGC AGC ATC GGT TCA TGC ACC GGA TGG ATG ATC GGT 
117 Val His He Ser Ser Ser He Gly Ser Cys Thr Gly Trp Met lie Gly 

719 CCG AAA ACC GTC GCA ACA GCC GGA CAC TGC ATC TAT GAC ACA TCA AGC 
133 Pro Lys Thr Val Ala Thr Ala Gly His .Cys He Tyr Asp Thr Ser Ser 

7 67 GGT TCA TTT GCC GGT ACA GCC ACT GTT TCG CCG GGA CGG AAC GGG ACA 

60 .70 
14 9 Gly Ser Phe Ala Gly Thr Ala Thr Val Ser Pro Gly Arg Asn Gly' Thr 

615 AGC TAT CCT TAC GGC TCA • GTT AAA TCG ACG CGC TAC TTT ATT CCG TCA 

80 

165 Ser Tyr Pro Tyr Gly Ser Val Lys Ser Thr Arg Tyr Phe He Pro Ser 

863 GGA TGG AGA AGC GGA AAC ACC AAT TAC GAT TAC GGC GCA ATC GAA CTA 
1B1 Gly Trp Arg Ser Gly Asn Thr Asn Tyr Asp Tyr Gly Ala He Glu Leu 

911 AGC GAA CCG ATC GGC AAT ACT GTC GGA TAC TTC GGA TAC TCG TAC ACT . 

110 

16 7 Ser Glu Pro He Gly Asn Thr Val Gly Tyr Phe Gly Tyr Ser Tyr Thr , 

95 9 ACT TCA TCA CTT GTT GGG ACA ACT GTT ACC ATC AGC GGC TAC CCA GGC 

*I20 ' &IZ5 '30 * 

213 Thr Ser Ser Leu Val Gly Thr Thr Val Thr lie Ser Gly Tyr Pro Gly 

1007 GAT AAA ACA GCA GGC ACA CAA TGG CAG CAT TCA GGA CCG ATT GCC ATC 
BL83 14-0 150 

229 Asp Lys. Thr Ala Gly Thr Gin Trp Gin His Ser Gly Pro lie Ala He 

1055 • TCC GAA ACG TAT AAA TTG CAG TAC GCA ATG GAC ACG TAC GGA GGA CAA 

IbO 

2*5 Ser Glu Thr Tyr Lys Leu Gin Tyr Ala Met Asp Thr Tyr Gly Gly Glri 

1103 AGC GGT TCA CCG GTA' TTC GAA CAA AGC AGC TCC AGA ACG AAC TGC AGC 
190 190 
261 Ser Gly Ser Pro Val Phe Glu Gin Ser Ser Ser Arg Thr Asn Cys Ser 

1151 GGT CCG TGC TCG CTT GCC GTA CAC ACA AAT GGA GTA TAC GGC GGC TCC 

190 

277 Gly Pro Cys Ser Leu Ala Val His Thr Asn Gly Val Tyr Gly Gly Ser 
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Fi ? !l-3 



1199 TCG TAC AAC AGA GGC ACC CGG ATT ACA AAA GAG GTG TTC GAC AAT TTG 
200 210 
293 ser Tyr Asn Arg Gly Thr Arg lie Thr Lys Glu Val Phe Asp Asn Leu 



124 7 ACC AAC TGG AAA AAC AGC GCA CAA TAA TAC ACG A AG ACA GCC CGC TTC 

.220 222 =r : 7 

309 Thr Asn Trp Lys Asn Ser Ala Gin *** lermmaror 



129 5- CTT TTG GAA CGG GCT GTC ACA TCT AAC GGC CGT ATA CTT AAT TTC CTT 

-> • i : 

134 3 TAA GCC TGT ACT TTT TGC CAT CTA TTG ATA TCG TGA AAT TTG AAG GAC 
1391 CGC TGA TCG GCA AAT AAT AGA CAA GCT GAA ACT CCG CTT CCT CAC CAG 

/tattsense primer CP> 

1439 GTT TGA ATG G 
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Fi 



genome, DMA °t 
Bacillus iicheniformis ATCCK580 



Hind Hi 




EcoRl 



Sense primer- (c) 
EcoRl 



BLase 



Xbal 

AvH sense, privwer (P) 



EcoRl Xbal 
I Worker \S\t^a\ | Mafureltt I | 

l EcoR! 
r Xbal 



EcoRl ® Xbal 



T4DNA ItQase. t 

Alkcxlmc, protege, i^miirva+Dr flleriv-e^ -yirom 
,Tcrvnmft+pr ) Bacillus subtilis ATCC6051 

Hind HI /Xbal ^TerwvW+or^ 
Mature, 




EcoRl 



Bacillus lichenHormts 
ATCCU580 BLase 
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GrenDmt DMA of 
Bacillus subtil is ATCC6051 



Hind III 




Xbal 
Hind 111 



Center pnvy»er (A) 
Xbal 



| PCR 



Hind III 
$mS$e*je primer CB) 



Xbal 



r Hind III 



Xbal 
Hind 01 



Hindi 




T4DNA U 



Xbal ® 

S "Ter»vum4or ( 1 1 ) 



Hindffl 



±3 



Hind II Xbal 




EcoRI 
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